A flexible front-end for HTS
نویسندگان
چکیده
Parametric speech synthesis techniques depend on full context acoustic models generated by language front-ends, which analyse linguistic and phonetic structure. HTS, the leading parametric synthesis system, can use a number of different front-ends to generate full context models for synthesis and training. In this paper we explore the use of a new text processing front-end that has been added to the speech recognition toolkit Kaldi as part of an ongoing project to produce a new parametric speech synthesis system, Idlak. The use of XML specification files, a modular design, and modern coding and testing approaches, make the Idlak front-end ideal for adding, altering and experimenting with the contexts used in full context acoustic models. The Idlak front-end was evaluated against the standard Festival front-end in the HTS system. Results from the Idlak front-end compare well with the more mature Festival front-end (Idlak 2.83 MOS vs Festival 2.85 MOS), although a slight reduction in naturalness perceived by non-native English speakers can be attributed to Festival’s insertion of non-punctuated pauses.
منابع مشابه
Multilingual TTS System of Nokia Entry for Blizzard 2010
In Nokia’s blizzard 2010 entry, we built the system with Nokia multilingual text to speech front end system and two high performance HTS backends. This MLTTS front end system describes the design and implementation designed for universal language coverage and a single code execution for them all based on the assumption that there are more features uniting world languages than differentiating them.
متن کاملIdlak Tangle: An Open Source Kaldi Based Parametric Speech Synthesiser Based on DNN
This paper presents a text to speech (TTS) extension to Kaldi a liberally licensed open source speech recognition system. The system, Idlak Tangle, uses recent deep neural network (DNN) methods for modelling speech, the Idlak XML based text processing system as the front end, and a newly released open source mixed excitation MLSA vocoder included in Idlak. The system has none of the licensing r...
متن کاملDevelopment of a bycatch reduction device (BRD) for shrimp beam trawl using flexible materials
This study aimed to design a bycatch reduction device (BRD) for shrimp beam trawl, which is manufactured by flexible materials to reduce bycatch for the gear in the South Sea of Korea. The model test was carried out to understand the shape of the gear in the water and to measure the variation of flow speed due to the BRD in a circulating water channel. Catches were compared between a shrimp b...
متن کاملOptimization of an HTS Induction/Synchronous Motor According to Changing of HTS Tapes Critical Current by Analytical Hierarchy Process
This paper represents the performance of a squirrel-cage High Temperature Superconducting Induction/ Synchronous Motor (HTS-ISM) based on nonlinear electrical equivalent circuit. The structure of the HTS-ISM is the same as that of the squirrel-cage type induction machine, and the secondary windings are fabricated by the use of the HTS wires. It has already been shown that based on the experimen...
متن کاملDevelopment of a bycatch reduction device (BRD) for shrimp beam trawl using flexible materials
This study aimed to design a bycatch reduction device (BRD) for shrimp beam trawl, which is manufactured by flexible materials to reduce bycatch for the gear in the South Sea of Korea. The model test was carried out to understand the shape of the gear in the water and to measure the variation of flow speed due to the BRD in a circulating water channel. Catches were compared between a shrimp b...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2014